Corpus: bos_wikipedia_2018_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 4757 p-
2 3060 s-
3 2077 o-
4 1968 n-
5 1888 k-
Top Character Bigrams
word rank frequency n-gram
1 2054 pr-
2 1735 po-
3 1066 na-
4 772 ko-
5 725 za-
Top Character Trigrams
word rank frequency n-gram
1 649 pro-
2 630 pre-
3 549 pri-
4 269 pos-
5 244 kon-
Top Character 4-Grams
word rank frequency n-gram
1 147 pred-
2 112 post-
3 93 prim-
4 92 prot-
5 92 komp-
Top Character 5-Grams
word rank frequency n-gram
1 58 inter-
2 53 nasta-
3 52 elekt-
4 51 organ-
5 50 preds-
682 msec needed at 2019-02-19 08:13